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I. INTRODUCTION 



The increased sophistication of military aircraft and 
the related increased dollar costs of procuring both 
aircraft and pilots places ever greater emphasis on the 
importance of determining a feasible method of reducing 
losses through aircraft accidents. Many research efforts to 
date have dealt with determining the causal factors 
underlying an aircraft accident, then using these causal 
factors, attempting to develop predictive models for 
accident occurence. 

Aircraft accidents have been broadly categorized in 
terms of aircraft design malfunctions, aircraft equipment 
failures, pilot or flight personnel error, and weather as 
primary causes for occurrence of mahor aircraft accidents. 
An accident is designated as a major accident if: 1) loss 
of life is involved; 2) complete loss of an aircraft is 
involved; or 3) substantial damage occurs to any aircraft 
involved. Substantial damage is defined in appendix A of 
OPNAVINST 3750.6 (series) . 

The most common cause of aircraft accident cited has 
been pilot error. Brictson, et. al. (1969) studied a four 
year span of aircraft carrier landing accidents involving 
attack and fighter aircraft. Approximately seventy-eight 
percent of the accidents studied had pilot error as the 
primary causal factor. Bricrson noted that the majority of 
the accidents were of two types, hard landings and 
undershooting the landing area. The small deck carriers 
accounted for seventy percent of the total accidents even 
though the large deck carriers had more activity. 
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Studies conducted for the Royal Air Force by Goorney 
(1965) dealt with the human factors involved in pilot error. 
He determined that pilot fatigue, emotional stress, 
complacency, lack of current flying experience contributed 
to pilot error and, if monitored, could lead to prediction 
of the likelihood of pilot error related accidents. 

There are some analysts who feel that if pilot error is 
a primary cause of accidents then the more proficient pilot 
should make fewer errors. This belief leads to the 
hypothesis that measures of pilot proficiency could be used 
as predictive measures. Keller (1961) hypothesized that 
flight time was positively correlated with pilot 
proficiency. He stated that were a pilot to fly the proper 
amount he would attain a safe proficient ability as a pilot. 
The procedure of how to determine the proper amount of 
flight time necessary to attain proficiency and how the 
number of hours needed would interact with fatigue and 
complacency were not fully explored. 

Collicot, et. al. (1972) compared accident rates of 
single-seat aircraft with those of dual piloted aircraft. 
They noted that if the operations were about egual the dual 
piloted aircraft had fewer accidents per ten thousand flight 
hours than the single piloted versions. Though the authors 
refer throughout their study to pilot proficiency they also 
allude to a possibility of temporary mental overload as a 
critical factor underlying pilot error. 

The determination of pilot error tends therefore to 
expand to include emphasis on temporary mental overload as 
well as pilot proficiency measures. Efforts by Kowalsky, 
et. al. (1974) were made to examine causal factors in high 
pilot error rates. Previous efforts to reduce pilot error 
had concentrated on improving pilot proficiency. Kowalsky 
and his co-researchers used cluster analysis and pattern 
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recognition techniques and discovered that the single most 
important causal factor was that, for non- training, 
non-midair accidents, pilots were often temporarily 
overloaded and incorrectly evaluated information presented 
during the period of overload. 

Many studies and much effort has been expended in 
accident research. The extensive data base maintained by 
the Naval Safety Center of accident related information 
opens doors for further statistical analysis of accident 
rates with goals of constructing useful predictive 
mathematical models. 

Myers (1974) hypothesized that measures of pilot 
proficiency and experience available in data collection 
banks would be sufficient to construct a predictive model. 
He used statistical techniques of principle component 
analysis applied to two groups of fifty pilots. One group, 
pilots who had been involved in aircraft accidents, the 
second, pilots with no accidents. The results were not as 
good as was desired, possibly due to the limited sample 
sizes employed. 

A second approach was used by Stuck! and Maxwell (1975) 
who used the techniques of regression analysis applied to 
data on ever two thousand aircraft accidents as collected by 
the Naval Safety Center. Their efforts dealt with pilot 
proficiency variables, aircraft variables and type of flight 
information. They then applied regression analysis to the 
composite group of all accident involved aircraft in the 
Navy's inventory. This effort yielded a predictive equation 
composed of four pilot related variables to predict 
variations in aircraft accident rates. 

Work by Eobino (1974) reported fluctuation in aircraft 
by months with the month of March significantly higher. 
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Subsequent efforts in this area by Poodk (1976) failed to 
support the March phenomena and go on to demonstrate that 
fluctuations in aircraft accident rates by month is purely 
random. 

The author of this study believes that the premise 
promoted by Myer, Stucki and Maxwell and others is valid. 
There should be sufficient data available on current 
aircraft accidents to conduct detailed statistical analysis 
with the resultant predictive equations both meaningful and 
useful. The variable nature of aircraft accident rates 
suggest that the underlying factors may be definable and, if 
they can be determined, used in accident prevention. 

If statistical analysis of aircraft accident rates can 
provide information on accident related variables, be they 
pilot-oriented, aircraft oriented or related to some other 
source, which vary either directly or inversely with 
aircraft accident rates then preventative actions can be 
taken to suppress the enormous costs in dollars and human 
life associated with aircraft accidents. 
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II. NATURE OF THE PROB LEM 



Monthly accident rates exhibit a marked variability when 
each calendar month is compared to other months. The belief 
that some months are consistently higher that others has 
been noted frequently in studies. This phenomena has been 
noted in studies of U. S. Air Force accident rates by Zeller 
and March (1973) and by Robino (1972) in a study of Navy 
aircraft accident rates. Recent work by Poock (1976) at the 
U. S. Naval Postgraduate School displays no statistical 
basis for any month being consistently high and attributes 
the fluctuations to random effects of the underlying causal 
factors. 

The accident rate is defined as the total number of 
accidents in a given month times ten thousand hours divided 
by the total number of flight hours flown that month. 

The efforts of this study, motivated by work of Stucki 
and Maxwell (1975) , were to explore accident rate dependence 
on time related variables by specific aircraft types where 
possible and composites of aircraft types where necessary. 
The results desired are a series of predictive equations 
unique to a specific aircraft or a community. It is 
believed that if in fact aircraft type has no large effect 
on accident rates that the data will yield similar equations 
for each type considered. 
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III. ANALYTICAL PROCEDURES 



This chapter contains the data selection procedure, the 
techniques employed in data preparation, a description of 
the analysis procedures and a summary of decision criterion 
employed in selecting the best equation for predicting the 
variance in the dependent variable rate. 



A. DATA SOURCE 



All Navy and Marine aircraft accidents and incidents are 
reported in detail to the Naval Safety Center, NAS, Norfolk, 
Va. The reporting criteria is detailed in Navy Aircraft 
Accident, Incident and Ground Reporting Procedures 
(OPNAVINST 3760. 6 (series) ) . As Naval Safety Center is a 
repository for all data recorded on aircraft accidents they 
are the source of data used in this report. 



B. DATA SELECTION 



As the goal of this study is to apply the concept 
envisioned by Stucki and Maxwell (1975) to individual type 
aircraft where possible and to group type of ai rcraft where 
necessary, the same basic data set as provided Stucki and 
Maxwell by the Naval Safety Center was employed. 

Table 1 lists the data initially requested from and 
provided by Naval Safety Center. 
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TABLE 1 



DATA SET REQUESTED FROH NAVAL SAFETY CENTER 



Data concerning the pilot; 



1 . Age 

2. Injuries 

3. Number of previous service tours 

4. Total flying time in aircraft model in which 
accident occurred 

5. Total flight hours in previous ninety days 

6. Total nighttime flight hours in previous ninety days 

7. Total daylight carrier landings in previous thirty 
days 

8. Total night carrier landings in previous thirty days 

9. Number of years as designated Naval Aviator 



Data concerning aircraft: 



1. Model 

2. Damage 

3. Number of tours between major aircraft rework 

4. Type of last major inspection 

5. Hours since last inspection 

6. Identification of the system or component failure 



Data concerning the flight; 



1 . Major command 

2. Reporting custodian 

3. Ships's Hull number (if applicable) 

4. Marine Air Wing (if applicable) 

5. Location 

6. Flight Purpose Code 

7. Type of operation code 

8. Phase of operation in which the accident occurred 



Data concerning the accident: 



1. Accident identification number including calendar 
date 

2. Other aircraft damaged 

3. Other personnel injured 

4. Contributing causal factors 

5. Special data not otherwise listed 

6. Weather 

7. Accident rate for the month in which the accident 
occurred 



From the available data set ten basic variables were 
selected in cooperation with Naval Safety Center personnel 
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for inclusion in this study (see Table 2) . 

In general, multiple regression requires that variables 
are measured on interval or ratio scale and that the 
relationships among the variables are linear and additative. 

The data set provided consisted of information on 
Two-Thousand-One-Hundred-Ten accidents or incidents which 
occurred during Fiscal Tears 1969 to 1974, inclusive. 
Selection of a suitable time span was based upon the 
following basic constraint considerations: 

(1) The necessity to have as large a sample size as 
possible to enhance validity of statistical inferences. 

(2) The desire to restrict the years in the sample to 
periods when the aircraft inventory was reasonably 
consistent. 

(3) The necessity to consider gaps and inconsistencies 
in desired data points due to changes and/or modifications 
in data collection requirements and recording procedures 
that had occurred within the Accident Reporting System. 

To accomplish these major considerations the entire data 
base was initially included. Then each variable was 
examined throughout the entire data set in terms of how 
changes in the reporting procedure or inconsistencies in the 
data would effect that variable. This treatment resulted in 
the reduction of the data base to the 
Five-Hundred-Sixty-Nine accidents occurring during the three 
year period FT 72-74 inclusive. 

During this period the inventory of aircraft with which 
this study deals was reasonably constant. 

1 

While this treatment did create a complete data base 
wherein all information desired was available for all 
accidents the resulting size does hamper the investigation 
of aircraft types in cases where the inventory is small to 
begin with and/or where there are few accidents as in the 
A-3 community. This leads in some instances to grouping 
data under categories such as Propeller, Attack or Helos. 
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C. VARIABLE SELECTION 

The ten basic variables selected for inclusion are shown 
in Table 2. 



TABLE 2 



DATA SET INCLUDED IN CURRENT STUDY 



1. Accident rate by month (RATE) 

2. Pilots age (AGE) 

3. Total flight time in accident involved aircraft 
model (T^IME) 

4. Total flight time during ninety days preceding 
accident (TOT90) 

5. Total night flight time during the preceeding ninety 
nights (NITE90) 

6. Daylight carrier landings during the preceeding 
thirty days (CLDAY) 

7. Night carrier landings during the preceeding thirty 
nights (CLNITE) 

8. Number of aircraft tours (ACTOUR) 

9. Aircraft flight hours since last major or calender 
inspection (ACHRS) 

10. Number of years designated Naval Aviator (DNA) 



In addition to the basic variables the author used an 
eleventh variable# DAY90 = TOT90 - NITE90, which is the 
total daylight flight time in the preceeding ninety days. 

Pilots age and years designated Naval Aviator were 
included as they are variables that are historically used as 
indicators of maturity and perhaps proficiency. If, as the 
author believes, the hypothesis that the older pilots tend 
to be safer pilots through a finer sense of judgement of 
risks involved is a valid hypothesis, the author would 
expect a negative simple correlation between AGE and DNA 
with rate. However because the number of other confounding 
factors is great a negative correlation would not justify 
the acceptance of the hypothesis. 

The variables consisting of pilot flight hours and 
carrier landings are considered to be measures of pilot 
currency and proficiency by many in the Navy and are 
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therefore included. 

Aircraft tours is included as a measure of the general 
condition of the aircraft and as an indication of aircraft 
age. Each aircraft in the Navy's inventory undergoes a 
Periodic Aircraft Rework (PAR) for analysis# repair and 
conversion at intervals unique to the model aircraft after a 
specific number of flight hours. This variable also serves 
to monitor any reliability anomalies other than 
"new-bet ter-than-used" as mentioned by Butterworth, et.al. 
(1974) . 

Aircraft hours is included as a measure of aircraft 
condition and usage since major inspection# primarily the 
calendar inspections. 
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D. DATA PREPARATION 

The basic assumptions for multiple regression analysis 
require that data be measured in at least interval or ratio 
scale and that the relationship among the variables be 
linear and additive. 

All data points used were ajudged to be measured on an 
interval scale. Raw data for each type or group of aircraft 
was averaged by months for each of the thirty-six months 
included in the data set where there was an accident for 
that type aircraft. 



E. THE ANALYSIS TECHNIQUE 

The analysis procedure employed was Multiple Regression 
using Forward (stepwise) Inclusion. The statistical Package 
for the Social Sciences (SPSS) compiled and edited by Nie# 
et.al. (1955) includes a forward stepwise multiple regression 
computer program package developed by Jae-On Kim and Frank 
J. Kohout at the University of Iowa. 
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This package was selected as the means of conducting the 
statistical analysis of the data sets. 

Kim and Kohout state that forward stepwise multiple 
regression is a recognized technique: "(1) to find the best 
linear prediction equation and evaluate its prediction 
accuracy; (2) to control for other confounding factors in 
order to evaluate the contribution of a specific variable or 
set of variables; and (3) to find structural relations and 
provide explanations for seemingly complex multivariate 
relationships, such as is done in path analysis.” 

The computer program provides the user with various 
options for treatment of data sets, calculation of 
statistics and output formats. The procedure, Listwise 
Deletion of Hissing Data, is the default option and the most 
conservative in that it maintains sample size and is the 
most accurate. Since the data base finally arrived at was 
complete, no data was deleted by the computer program 
option. If the data base were missing quite a few 
individual data points this procedure could result in a 
drastic decrease in the sample size. This fact was one of 
the underlying considerations in the data base selection 
criteria. 

The options not selected could, if not used with 
considerable prudence and judgement, result in the 
introduction of large amounts of bias that would be very 
difficult to detect without a very good feel for expected 
experimental results. It is for this reason that the 
Listwise Deletion of Hissing Data option was used for all 
computer runs in the current study. Kim and Kohout state 
that: 

"There are many occasions for which simple linear 
models are inadequate. It may be that (1) the bivariate 
relationship is expected {on the basis of theory) to take 
a specific nonlinear form, (2) the bivariate relationship 
is simply unknown, and the examination of the scatterplots 
suggests clear deviation from linearity or, at least, the 
need for testing the adequacy of the linearity assumption, 
or (3) the combined effects of the independent variables 
are not additive. Some of the ways to handle these types 
of nonlinear situations are (1) to transform the original 
variables in such a way that the resultant relationships 
among the transformed variables become linear, (2) to find 
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a simple nonlinear form through the use of polynomial 
regression, and (3) to introduce interaction terms as new 
variables. ” 

There are two extreme viewpoints in regression 
analysis, with valid arguments to support both cases. 
Draper and Smith (1966) explain that the two opposing 
viewpoints are "(1) to make the prediction equation valid 
you should include as many predictor variables as possible; 
and (2) because of increased cost of obtaining variables and 
monitoring them, the equations should include as few 
variables as possible. 11 

The process of selecting the best regression equation 
is the process of compromising between these two extreme 
viewpoints. There is no unique statistical procedure for 
choosing the 'best' equation and large amounts of personal 
judgement are required. In this regard the techniques of 
regression analysis become an art as well as a science. 

Initial analysis of data from the current study 
displayed indications of nonlinearity and interactive 
effects between independent variables. 

To deal with these effects no single set of regression 
variables were deemed 'best' but rather a series of seven 
different regression variable packages were constructed. 
Each data set was run with all seven different packages. 

i 

Variables eight and nine from Table 2 were deemed 
aircraft oriented variables while the remaining basic 
variables were considered human oriented measures. 

Regression I consisted of only those basic variables 
that were human related. Regression II contained all the 
basic variables pilot -or aircraft oriented. Regression III 
used the pilot oriented basic variables plus transformations 
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consisting of the square and the square root of each basic 
variable used. Regression IV includes all of Regression III 
plus the square of, the square root of and the two aircraft 
related basic variables. Regression V contains all of 
Regression III plus the twenty-eight possible cross-products 
of the eight basic variables. Regression VI contains all of 
Regression V plus the aircraft related basic variables, 
their squares and square roots and the cross product of 
Aircraft Tours and Aircraft Hours. Regression VII contains 
the ten basic independent variables, their squares, square 
roots and the forty-five possible cross-products. 

The decision to employ squares and square roots was 
made to provide a larger number of variables capable of 
accounting for curvilinearty . The introduction of 
cross-products allows for interactive effects of independent 
variables. Ihe use of the Forward (stepwise) Inclusion 
Multiple Regression computer program facilitates the 
creation and inclusion of many various transforms. The 
packages used in the study were considered the most 
versatile of the trial packages used in preliminary studies 
by the author. 



F. DECISION CRITERIA 

V 



The forward stepwise multiple regression program 
contains pr eselectable stopping criteria that were adjusted 
to facilitate introduction of variables into the equation 
that by themselves made a significant contribution to 
explaining the variance in the dependent variable RATE. As 
a 'rule-of-thumb' in predictive equation selection the study 
attempts to restrict the number of variables in each 
equation to five. This decision is based upon the degrees 
of freedoii in the regression equation and the need to 
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maintain a significant ratio to provide a solid statistical 
base for conclusions. 

With the degree of freedom restrictions attained the 
primary decision criteria are: 

1) The equation with a significance level of 
100 (1-alfa) percent greater than or equal to ninty-five 
percent, and 

2) That equation that accounted for the largest amount 
of variance in the dependent variable. 

In those cases where the choice of the 'best' equation 
was not clearly indicated other more subjective measures 
were employed, such as, examination of scatterplots of the 
standardized residual versus the standardized predicted 
dependent variable, the plot of the standardized residuals, 
and consideration of the intuitive impact of the particular 
variables in the equations under consideration. 

For example, all other decision criteria being 
statistically equal the equation containing CLDAY - (CLDAI) 2 
+ (RTCLDAY) would be selected over the equation containing 
(AGE) (ACT00RS) + (TTIME) (ACHRS) . 

Since many of the regression packages were very similar 
some cases could yield the same equations for more than one 
regression package, while other cases could yield no 
significant equation for any regression package. In either 
case for completeness the 'best' equations is indicated in 
the results even if that equation is not statistically 
significant. 

This chapter has described the analysis procedures used 
in the development of predictive equations for variance in 
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accident rates by month. The next chapter contains the 
results of the analysis by aircraft community and aircraft 
type where possible. 
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IV. R ESUL TS 



The results by aircraft type or aircraft community are 
contained in this chapter. The best predictive equation 
provided by the seven Regression Packages are shown. 



A. ATTACK AIRCRAFT 



The aircraft of the attack community were divided into a 
composite regression and three separate, regressions . The 
composite consisted of accident involved aircraft of the 
types A-3, A-4, A-5, A-6, and A-7. All variants of each 
type aircraft were included (for example, KA-3, EA-3 and 
A-3). The three separate regressions were conducted on A-4, 
A-6, and A-7 respectively. 



1 ♦ l££iick Composite 



This category was used to investigate trends unique 
to the Attack community and not peculiar to a specific 
attack type aircraft. Additionally, the relatively small 
size of the A-3 and A-5 communities precluded an independent 
analysis of these aircraft. Rather than omit A-3's and 
A-5's they were included here. The accident involved 
aircraft in this category provided the maximum sample size 
of thirty-six data points for analysis. As the number of 
aircraft included of each type are not equal the category 
may be biased towards the larger A-4 and A-7 communities, 
however, the author felt that this would in no way endanger 
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the results as the Attack community is being considered as a 
community. While data was available for A-1's they were not 
included for two reasons, firstly, it was desired to limit 
the category to jet aircraft and secondly, the A-1 has been 
phased out of the active aircraft inventory. 

The basic variables in the regression accounted for 
less than twenty percent of the variance in accidents rate, 
however, the regressions consisting of transformed variables 
yielded two equations of approximately equal quality which 
are: 

A. Bate ( ATTACK) = 0.98593 - 0.01221 (ACTOOR) 2 + 
0.00232 (ACHES) - 0.00226 (CLDAT) 2 + 0.62245 (BTCLNITE) - 
0.00193 (NITE90) 2 



and. 



B. Bate (ATTACK) = 0.79467 - 0.00268 (CLDAY) 2 + 

0.68955 (BTCLNITE) - 0.00664 (NITE90) 2 + 0.15186 (NITE90) - 
0.09602 (BTTOT90) . 



Equation A and equation B are both significant at 
the 99 percent level and equation B accounts for 59.74 
percent of variance in rate contrasted to 51.11 percent for 
equation A. The author's decision criteria were met by both 
equations, however, examination of residual plots favored 
equation A by a narrow margin. The Forward (stepwise) 
Regression criteria selects variables for inclusion in the 
predictive eguation by adding the variable that accounts for 
the largest increase in the percent of variance in the 
dependent variable. It is of interest to note that the 
aircraft oriented variables entered the predictive equation 
first in Equation A followed by the pilot oriented 
variables. With the deletion of aircraft variables the 
second equation provided by only pilot oriented variables 
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accounted for an additional 8.63 percent of variance in rate 
but with a lesser initial effect of the first two variables 
added . 



It can be observed that equation A contains two 
aircraft related variables while equation B is composed 
entirely of pilot oriented variables of which three are 
included in both equations. The predictive equation (B) 
contains the variable NITE90 in two functional forms. The 
net effect on the dependent variable rate is positive for 
values of NITE90 less than or equal to 22.87 hours. For 
hours greater than 22.87 the effect is to reduce the 
predicted monthly accident rate. 



2 . A- 1 Aircraf t 



This category contains all accident involved A-4 and 
TA-4 aircraft in the three year period studied and provides 
a sample size of thirty-one cases. The regression of the 
basic variables accounted for only 22 percent of variance in 
rate at a significance level of 75 percent. The predictive 
equation considered 'best' was: 

Rate (A-4) = -0.00804 (DNA) 2 + 0. 10473 (RT ACHES) 
-0.00010 (D A Y90) 2 + 0.77584 (RTNITE90) - 0.11160 (NITE90) - 
0.13246. 



This equation, significant at the 95 percent level, 
accounts for 42.68 percent of the variance in rate. 

It is noted that four of the five variables are 
pilot oriented variables three of which are based on hours 
flown. The predictive equation for A-4 aircraft contains 
NITE90 in two functional forms as did the predictive 
equation for the Attach community. Again as in the Attack 
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community the net effect of NITE90 is positive for the lower 
number of hours flown. Particularly the net effect is 
positive for NITE90 less than or equal to forty-eight hours 
and negative for values greater than forty-eight. 



3 . A-6 Aircraft 

This category was restricted to a sample size of 
twenty due to relatively few accidents and a smaller 
community. In order to achieve the largest sample size 
possible the author included EA-6 aircraft with the A-6 and 
KA-6 models. The small sample size tends to make suspect 
any results derived by regression analysis. 

The basic variables accounted for less than ten 
percent of the variance in rate while the 'best' predictive 
equation accounted for 40.27 percent. This equation is 
however significant only at the 75 percent level. The 
author feels that the small sample size tends to negate any 
usefulness of this regression. The equation is included for 
continunity of the study and for discussion purposes. The 
equation is: 



Bate (A-6) = 16.28967 - 0.04604 (DNA) 2 + 2.33592 
(RTDNA) - 20.30561 (BTACTOOR) + 5.34649 (ACTOOB) + 0.05874 
(BTTIME) . 

It is noted here that only three basic variables are 
used in seme functional form with a balance of aircraft and 
pilot oriented variables used. The independent variables 
DNA and ACTOOB are each used in two functional forms. The 
net effect of ACTOOBS is positive for values greater than or 
equal to 15 tours while the net effect of DNA on the 
dependent variable rate is positive for values less than or 
equal to 13.70 and becomes negative for values greater than 
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13.7 years. 



4 . A- 7 Aircraft 

This category provides a sample size of thirty-two 
cases based on all A-7 aircraft models involved in accidents 
during the study period. The regression analysis yielded 
the following predictive equation: 

Rate (A-7) = 0.27170 (RTCLDAY) - 0.01856 (CLNITE) 2 + 
0.21346 (RTNITE90) + 4.01 164 (RTDNA) - 0.88896 (DNA) - 
3.70545. 



This equation, significant at the 99 percent level 
accounts for 55.01 percent of the variance in rate. 

The predictive equation for A-7 aircraft also 
contains two functional forms of DNA. Again here as in the 
A-6 the net effect is negative for large values of DNA, 
greater than 21 years in this case. While the effect on 
rate is positive for values less than 21 years the net 
effect decreases as the value of DNA approachs 21 years. 
This agrees with the intuitive feeling that DNA is a measure 
of pilot proficiency and the more experience the safer the 
pilot. 
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TABLE 3 



ATTACK AIRCRAFT VARIABLE SUMMARY 



VARIABLE 


ATTACK-A ATTACK-B 


A— 4 A- 6 


A- 7 


FREQ 


NITE90 


0.07382 


0.12366 




2 


ACTOOR 




-0.17445 




1 


ACHRS 


0.31152 






1 


DNA 






0. 15431 


1 


DAY902 




-0.12592 




1 


NITE902 


-0.04184 -0.04184 






1/1 


CLDAY2 


-0.31728 -0.31728 






1/1 


CLNITE2 






-0.23345 


1 


ACT0UR2 


-0.37244 






1 


DNA2 




-0.31353 -0.34703 




2 


RTTIME 




-0.03596 




1 


RTTOT90 


0.06426 






1 


RTNITE90 




0.22177 


0.22398 


2 


RTCLDAY 






0.34684 


1 


RTCLNITE 


0. 13914 0. 13914 






1/1 


RTACTOUR 




-0.14537 




1 


RTACHRS 




0.30860 




1 


RTDNA 




-0.24288 


0.22979 


2 



TABLE ENTRIES ARE THE CORRELATION COEFFICIENT OF THE 
DISPLAYED VARIABLE WITH RATE 



Table 3 displays the basic and transformed variables 
as used in the regression package. The suffix '2* indicates 
that variable squared and the prefix 'RT' the square root of 
the variable. Tabled are the correlation coefficients of 
the displayed variables with the dependent variable rate. 

For those cases where the variable appears in more 
than one equation the correlation coefficients are quite 
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consistant with the exception of RTDNA where the coefficient 
for A-»7 is contrary to what would be normally considered 
true. While it is generally believed, the remaining 
aircraft types bear out, that the adage that the older, more 
experienced pilot has fewer accidents, this does not appear 
to hold for the A-7 aircraft. It is noted that the values 
for DNA and RTDNA in Table 3 for A-7 are positively 
correlated, in addition the coefficient for pilot age is 
also positively correlated for A-7. The author is unable 
from his experience to explain this unusual occurance. 



TABLE 4 



ATTACK AIRCRAFT BASIC VARIABLE SUMMARY 



VARIABLE ATTACK-A ATTACK-B A-4 

AGE 

TTIME 

TOT90 1 

DAY90 1 

NITE90 1 2 2 

CLDAY 1 1 

CLNITE 1 1 

DNA 1 

ACTODR 1 

ACHRS 1 1 



A- 6 A-7 

1 

1 

1 

1 

2 2 

2 



FREQ 

0/0 

1/1 

0/1 

1/1 

4/5 

2/2 

2/2 

5/5 

3/2 

2/1 



Table 4 relates the usage of basic variables in each 
category in some functional form. Basic variables AGi, 
TTIME, T0T90, and DAY90 are used one time or less indicating 
that these measures have little or no effect on predicting 
variance in aircraft accident rates. NITE90 and DNA are the 
high usage variables followed by CLDAY, CLNITE, ACTOUR and 
ACHRS with moderate usage. 
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B. FIGHTER AIRCRAFT 



The analysis of fighter aircraft was restricted to F-4 
and a composite of F-4 and F-8 aircraft. The data base did 
not provide enough data to conduct independent analysis of 
F-8’s by themselves which led to the composite category. 

1 . Fighter Composite 

This category was included to provide a method of 
including F-8 aircraft and to facilitate the possible 
contrast of the Attack and Fighter communities. 

The composite analysis yeilded a sample size of 
thirty-six cases primarily on the strength of the F-4 
community. The basic variable regression accounted for less 
than twenty-five percent of the variance in rate at a 
significance level of 95 percent. Once again the regression 
using transformed variables provided a better predictive 
equation as shown below. 



Rate (Fighter) = 1 .21906 (RTACTOUR) + 0.23768 

(RTDAY90) + 0.01897 (CLDAY) 2 ♦ 2.38695 (RTCLDAY) -0.92126 

(CLDAY) - 2.48215. 



This equation accounts for 40.45 percent of the 
variance in rate and is significant at the 99 percent level. 

It is observed that the variable CLDAY appears in 
each of its functional forms and while it does not account 
for the most variance initially in conjunction with the 
forms of ACTOOR and DAY90 it adds about sixteen percent to 
the accounting of variance in rate. The net effect on the 
dependent variable rate of the variable CLDAY is positive 
for values less than or equal to 11.56 daytime carrier 
landings in thirty days. For values greater than 11.56 the 
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net effect becomes negative and will tend to decrease the 
accident rate. 

2 . P-4 Aircraft 

The category of F-4 aircraft consisted of a sample 
size of thirty-six data points. Regression analysis yielded 
the following predictive equation: 

Rate (F-4) = 0.19142 (RTDAY90) - 0.03663 (CLNITE) 2 - 
0.000002 (TTIME) 2 + 0.17982 (RTTIME) -0.01302 (DNA) 2 - 
1.59073. 



This equation accounts for 34.79 percent of variance 
of rate at significance level 95 percent. The equation 
generated by the basic variables alone accounted for less 
than nine percent and were not significant at the 75 percent 
level. 

The predictive equation deemed 'best* was generated 
from Regression III and contained only pilot-oriented 
variables. 

The variable TTIME appears here in two functional 
forms with a positive net effect on rate for values less 
than or equal to 2006 hours. 
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TABLE 5 



FIGHTER AIRCBAFT VARIABLE SUMMARY 



VARIABLE 


FIGHTER 


F-4 


FREQ 


CLDAY 


-0. 16666 




2 


TTIME2 




-0.11285 


1 


CLDAY2 


-0.26346 




1 


CLNITE2 




-0.19546 


1 


DNA2 




-0. 16277 


1 


RTTIME 




0.08688 


1 


RTDAY90 


0.24865 


0. 23319 


2 


RTCLDAY 


0.00643 




1 


RTACTOUR 


0. 44776 




1 



TABLE ENTRIES ARE THE CORRELATION COEFFICIENTS OF THE 
DISPLAYED VARIABLE WITH RATE 



In the case RTDAY90 which appears in both equations 
the correlation coefficients are quite consistent. It is 
noted that the variables normally considered as pilot 
proficiency variables are negatively correlated with rate, 
except in the cases where square roots are used. 
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TABLE 6 



FIGHTEB AIBCBAFT BASIC 7ABIABLE SUMMARY 

V ABIABLE 
AGE 
TTIME 
TOT90 
DAY 90 
NITE90 
CLDAY 
CLNITE 
DNA 

ACTOUB 
ACHES 

Table 6 relates the usage of basic variables in some 
functional form by category. The basic variables CLDAY, 
TTIME, and DAY90 are the high usage variables in this 
category and are all pilot related variables. The variables 
AGE, TOT90, NITE90 and ACHES did not appear in any form in 
the fighter community 



FIGHTEB F-4 FEEQ 

0 

2 2 

0 

1 1 2 

0 

3 3 

1 1 

1 1 

1 1 

0 



C. PBOPELLEB AIBCEAFT 

The aircraft considered in the propeller aircraft 
category consisted of E-1, E-2, C-1, C-2, S-2, P-3, C-117, 
C-118, and C-130. Due to the relatively small size of each 
individual community and the infrequency of accidents it was 
necessary to combine all aircraft into one category entitled 
1 PROPS'. This procedure is somewhat unnerving as there are 
normally aspirated and turboprop aircraft together as well 
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as carrier-based and land-based aircraft. This tends to 
bias the results and applications or inferences cannot be 
directed toward any particular member aircraft in the group. 

The result of aggregating the above aircraft is a sample 
size of twenty-six cases which provides an equation of basic 
variables that accounts for less than thirty percent of the 
variance in rate at a significance level of seventy-five 
percent. 

Regression packages V, VI and VII provided the same 
equation which was judged 'best' by the author. 



Rate (PROPS) = 0.35935 + 0.000Q2[ (TTIHE) (NITE90 ] 

0.00022 (NITE90) 2 - 0.00001 [ (AGE) (TTIHE) ] + 0.00108 

[ (CLDAI) (ENA) ] -0. 00595 (CLNITE) 2. 

The predictive equation accounts for 43.25 percent of 
the variance in rate at a significance level of 95 percent. 
This category provides the only case where cross-products 
contribute to the predictive equation. The equation 
consists of only six basic variables in some functional 
form, and all six are pilot oriented variables. This could 
be indicative of the inherent safety of large multi-engined 
propeller aircraft where a flight may be aborted due to a 
mechanical failure with a lower probability of an accident 
resulting frcm the mechanical failure. 

The inclusion of pilot oriented variables in the area of 
carrier landings casts doubt upon the validity of the 
predictive equation because many of the aircraft are not 
carrier-based and their pilots do not record carrier 
landings. The equation is still considered valid by the 
author in that a value of zero was recorded for those pilots 
with no carrier landings and if the regression technique 
still selects that variable it is due to the correlation 
interactions with that variable and the combined independent 
variables included in the equation with the dependent 
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variable rate 



TABLE 7 

PROPELLER AIRCRAFT VARIABLE SUMMARY 



VARIABLE 

NITE902 

CLNITE2 

CLDDNA 

TTNITE 

AGEDNA 

TABLE ENTRIES ARE THE 
DISPLAYED 



PROPS 
o'. 09764 
-0.22505 
0.21845 
0.55415 
0.19878 

CORRELATION COEFFICIENT OF THE 
VARIABLE WITH RATE 



TABLE 8 

PROPELLER AIRCRAFT BASIC VARIABLE SUMMARY 



VARIABLE 


PROPELLERS 


FREQUENCY 


AGE 


1 


1 


TTIME 


2 


2 


TOT90 


0 


0 


DAY90 


0 


0 


NITE90 


2 


2 


CLDAY 


1 


1 


CLNITE 


1 


1 


DNA 


1 


1 


ACTOUB 


0 


0 


ACHRS 


0 


0 
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Table 7 shows that only one variable, CLNITE2, is 
negatively correlated with rate, while the remaining 
variable forms are all positively correlated. Table 8 
reflects the degree of usage of each basic variable with 
total time and night hours being used twice and the 
remaining basic varaibles one time or not at all. 



D. HELICCPTEBS 

The category helicopters consists of the aggregate of 
H-1, H-2, H-3, H-46 and H-53. This category, like that of 
Propellers did not provide sufficient data to conduct 
independent analysis by type aircraft. The aggregate 
yielded a sample size of thirty-three cases. 

The analysis yielded only two predictive equations for 
the seven regression packages. The equation provided by the 
basic variables was ajudged ‘best' and is: 

Bate (HELO) = 0.00062 (TTIflE) + 0.65405. 

This equation, significant at the ninety percent level 
accounts for 9.30 percent of the variance in rate. The 
second equation provided by the remaining five regressions 
accounted for 11.45 percent of the variance in rate. The 
difference cf 2. 15 percent was not deemed sufficient 
increase to use the second equation which consisted of the 
variable TTIME squared. 
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V. 



DISCUSSION 



It is interesting to note that even though the variable 
AGE was the most significient single variable in the overall 
eguation arrived at in the study by Stucki and Maxwell that 
AGE appeared only once in the current study. The variable 
AGE appeared as a cross product with TTIME in the prediction 
equation for propeller aircraft. The current study employs 
the variable DNA which was not used by Stucki and Maxwell. 
The variable DNA was used in some functional form seven 
times and represents the highest single useage of any basic 
variable. The simple correlation between AGE and DNA is 
quite high in each of the eight categorys. Intuitively this 
implys that one or the other will dominate and both will try 
to account for the same portion of variance in rate 
explainable by this type of variable. A similar trend 
appears in looking at the usage of TOT90, DAI90 and NITE90. 
As the sum of DAI90 and NITE90 equals TOT90 it would be 
expected that one of the variables would dominate the 
predictions. This is in fact the case as NITE90 is used six 
times, DAY 90 three times and T0T90 enters only in the 
alternate best equation for Attack aircraft. The variables 
TTIME and CLDAI are each used six times in some functional 
form while CLNITE appears four times. The aircraft oriented 
variables appear a total of six times in some functional 
form, four times for ACTOUR and twice for ACHES. 

Although some of the Regression Packages yielded the 
same equation within categories. Regression Package IV can 
be credited with the best equation in five out of eight 
categories. Regression IV provided the best equation in the 
four attack aircraft categories and in the category Fighter 
aircraft. Regression Package V provided the Propeller 
equation. Regression II the Helicopter equation and 
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Eegression III the F-4 equation. The predictive equation 
for F-4 is the only case where the best equation was 
provided by a regression package that contained only pilot 
oriented variables. The category consisting of Helicopters 
was the only case where the transformed variables did not 
provide a large improvement over pure basic variables in the 
predictive equation generated. 

The majority of the variables entered in the eight 
predictive equations were of the variable squared or the 
square root of the variable, thirteen and fourteen times 
respectively. Three cross products used six variables and 
the basic variables appeared six times. 

There does not appear to be any trend or tendency for 
any particular basic variable to be consistent over the 
range of the eight categories considered. If the hypothesis 
that the older more experienced pilot is a safer pilot is 
valid and if the variables of AGE and DNA can be considered 
as measures of this hypothesis, then the author would expect 
the simple correlation coefficients of these variables to be 
negative. This is not the case as five of the sixteen 
coefficients are positive. It is possible that due to the 
relatively small size of each sample and the fact that there 
are months where only one accident occured that a 
coefficient could be only slightly positive without 
violating the hypothesis. This does not explain the 
coefficients of the A-7 category (see Table 9) where the 
coefficient of AGE is on the order of 0.26 and DNA is 0.15. 
As stated previously the author is unable to explain this 
phenomena. Similar arguments can be generated for each of 
the ten basic variables. The closest case to being 
consistent in sign is with CLNITE where all coefficients are 
negative except that of A-6 which is 0.07. The value of 
0.07 for A-6 combined with the extremely small sample 
considered in this case (20 data points) leads the author to 
discount any significance in sign for this variable. 
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TABLE 9 





ATTACK 


A-4 


A-6 


A-7 


AGE 


-0.27305 


-0.12430 


-0.30084 


0. 265 25 


TTIME 


-0.05110 


-0.08715 


-0.09278 


0. 17624 


TOT90 


0.02845 


-0.01864 


-0.02922 


-0. 03100 


DAI90 


0.01701 


-0.08675 


0.02350 


-0.07358 


NITE90 


0.07302 


0.12366 


-0.11615 


0. 07386 


CLDAI 


-0.19153 


0.05906 


-0.02135 


0. 21725 


CLNITE 


-0.11847 


-0.01026 


0.07686 


-0. 03701 


ACTOUB 


-0.33037 


r0. 1 1 151 


-0.17445 


-0.16477 


ACHBS 


0.31152 


0.29628 


0.10899 


-0.11530 


DNA 


-0.18404 


-0.25457 


-0.28943 


0. 15431 




PIGHTEB 


F-4 


PROPS 


HELOS 


AGE 


-0.006448 


0.00785 


-0.06703 


0.09091 


TTIME 


0. 15639 


0.01361 


0.21211 


0.30509 


TOT90 


0.11077 


0. 14855 


0.08224 


0.09746 


DAY90 


0.23847 


0.20732 


-0.00069 


0. 12454 


NITE90 


-0.22743 


-0. 14624 


0.16823 


-0. 00036 


CLDAI 


-0. 16666 


0. 10926 


0.20479 


** 


CLNITE 


-0. 161 56 


-0.12479 


-0.22141 


** 


ACTOUB 


0.42709 


0.19403 


0.14963 


-0.07866 


ACHBS 


0.03460 


0.20765 


0.12285 


0.08244 


DNA 


-0.025 18 


-0.05232 


-0.05969 


0.04829 






** no 


carrier landings recorded 


TABLED 


VALUES ABE THE 


! CORRELATION 


COEFFICIENTS OF THE BASIC 




VABIABLES WITH ACCIDENT 


RATE BY CATEGORY 


The 


net effect on 


Rate by the 


combined functional forms 


of a basic variable 


was discussed in the 


results by 


category . 


It should 


be noted, however, that 


adjusting the 


values of 


a basic variable to bring 


about a decrease in the 


accident 


rate could 


potentially 


cause one 


of the other 



measures to shift in such a way that the total net effect 
would be to increase the accident rate. 
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V I . RECOMMENDATIONS 



The current study consisted of analysis of aircraft 
accident rate by type aircraft. In this approach the size 
of the communities and the resulting size of the data base 
are such that the analysis is constrained by the degrees of 
freedom available in regression techniques. As a future 
study the techniques and hypotheses employed in this study 
would be a useful starting point. The necessity of a larger 
data base would be overcome by time as the inventory studied 
is not that different from the current Navy inventory. The 
sensitivity of some of the basic variables employed in this 
study suggest that future studies procure additional data of 
the following types: 

1) In addition to the number of day and night carrier 
landings in the past thirty days, a numerical grade of the 
quality of each landing made should be included. 

2) A breakdown of the hours flown in the preceeding 
ninety days to include, for example, flight hours in past 24 
hours, flight hours in past 72 hours. This would allow 
inclusion of concepts of fatigue versus proficiency. 

3) In addition to AGE and DNA, the number of months in 
operational flying billets and the number of months in 
current tour. 

The data base employed contained much information on 
accidents and allows constructing .a profile of the pilot who 
had an accident. The single most severe hinderance to this 
author in drawing conclusions was the lack of adequate or 
equal knowledge of the pilot who did not have an accident. 
It is recommended that prior to any future studies of this 
type the analyst procure data on accident free pilots with 
as many variables in common with the accident involved pilot 
as feasible. The hinderance to this author was that a 



38 



profile of the accident involved pilot may be identical to a 
profile of the non-accident involved pilot. Without the 
results of this comparison the usefulness of the predictive 
equations for prediction cannot be demonstrated 
statistically. It is possible however that the equations 
could be validated by usinq them to attempt to predict and 
comparinq the actual resulting rates. 

The real benefits of this study are in the analysis of 
the variables that enter the equations and by using the 
frequency of appearance in planning future studies with even 
greater detail in those areas where the variables appear to 
contribute the most. 

While this study is somewhat broad in scope it does 
provide encouragement for future efforts along this line of 
reasoning. The ever increasing necessity to reduce loss in 
human life and dollars due to aircraft accidents provides 
the incentive. 
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APPENDIX A 



AVEHAGE MONTHLY DATA POINT VALDES 



This appendix contains the average monthly data point 
values for the basic variables and the dependent variable. 
Sate. Each aircraft type or community examined in the study 
is recorded in a table. 

The dependent variable Rate is the aircraft accident 
rate per ten thousand hours. Rate is calculated by taking 
the number of aircraft accidents for the month times ten 
thousand hours and dividing by the total number of hours 
flown by that type aircraft for the month. 
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MONTH HATE AGE TTIME TOT90 DAY90 NITE90 CLDAY CLNITE ACTOUB ACHRS DNA 
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